Unsupervised Aggregation for Classification Problems with Large Numbers of Categories

نویسندگان

  • Ivan Titov
  • Alexandre Klementiev
  • Kevin Small
  • Dan Roth
چکیده

Classification problems with a very large or unbounded set of output categories are common in many areas such as natural language and image processing. In order to improve accuracy on these tasks, it is natural for a decision-maker to combine predictions from various sources. However, supervised data needed to fit an aggregation model is often difficult to obtain, especially if needed for multiple domains. Therefore, we propose a generative model for unsupervised aggregation which exploits the agreement signal to estimate the expertise of individual judges. Due to the large output space size, this aggregation model cannot encode expertise of constituent judges with respect to every category for all problems. Consequently, we extend it by incorporating the notion of category types to account for variability of the judge expertise depending on the type. The viability of our approach is demonstrated both on synthetic experiments and on a practical task of syntactic parser aggregation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A COGNITIVE STYLE AND AGGREGATION OPERATOR MODEL: A LINGUISTIC APPROACH FOR CLASSIFICATION AND SELECTION OF THE AGGREGATION OPERATORS

Aggregation operators (AOs) have been studied by many schol- ars. As many AOs are proposed, there is still lacking approach to classify the categories of AO, and to select the appropriate AO within the AO candidates. In this research, each AO can be regarded as a cognitive style or individual dierence. A Cognitive Style and Aggregation Operator (CSAO) model is pro- posed to analyze the mapping ...

متن کامل

A robust aggregation operator for multi-criteria decision-making method with bipolar fuzzy soft environment

Molodtsov initiated soft set theory that provided a general mathematicalframework for handling with uncertainties in which we encounter the data by affix parameterized factor during the information analysis as differentiated to fuzzy as well as bipolar fuzzy set theory.The main object of this paper is to lay a foundation for providing a new application of bipolar fuzzy soft tool in ...

متن کامل

Dennis Sun DATA

Overview A large number of data analytical procedures are used for the purposes of prediction. The general form of a prediction problem is as follows: Uusually, the prediction is to be made in a way that minimizes some objective function error: ˆ f = argminˆf ′ Error(f, ˆ f ′) The Error() can be computed differently depending on circumstances. The space of possible predictions { ˆ f ′ ()} may b...

متن کامل

Power harmonic aggregation operator with trapezoidal intuitionistic fuzzy numbers for solving MAGDM problems

Trapezoidal intuitionistic fuzzy numbers (TrIFNs) express abundant and flexible information in a suitable manner and  are very useful to depict the decision information in the procedure of decision making. In this paper, some new aggregation operators, such as, trapezoidal intuitionistic fuzzy weighted power harmonic mean (TrIFWPHM) operator, trapezoidal intuitionistic fuzzy ordered weighted po...

متن کامل

A comprehensive experimental comparison of the aggregation techniques for face recognition

In face recognition, one of the most important problems to tackle is a large amount of data and the redundancy of information contained in facial images. There are numerous approaches attempting to reduce this redundancy. One of them is information aggregation based on the results of classifiers built on selected facial areas being the most salient regions from the point of view of classificati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010